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ACTION 



International application No. 

PCT/ US 98/H529 



International filing date (day/month/year) 

14/07/1998 



(Earliest) Priority Date (day/month/year) 

15/07/1997 



Applicant 



GENENCOR INTERNATIONAL, INC. et al. 



This International Search Report has been prepared by this International Searching Authority and is transmitted to the applicant 
according to Article 18. A copy is being transmitted to the international Bureau. 



This Internationa) Search Report consists of a total of _ 



sheets. 



[ [ ft is aJso accompanied by a copy of each prior art document cited in this report. 



1 . Q Certain claims were found unsearchable (see Box I). 

2. Unity of invention is lacking (see Box II). 



3. W\ The international application contains disclosure of a nucleotide and/or amino acid sequence listing and the 
international search was carried out on the basis of the sequence listing 

[ | filed with the international application. 

[)("| furnished by the applicant separately from the international application, 

| [ but not accompanied by a statement to the effect that it did not include 
matter going beyond the disclosure in the international application as filed. 

| | Transcribed by this Authority 



4. With regard to the title, the text is approved as submitted by the applicant. 

| | the text has been established by this Authority to read as follows: 



5. With regard to the abstract, 



[)(] the text is approved as submitted by the applicant. 

| | the text has been established, according to Rule 38.2(b), by this Authority as it appears in 
Box III. The applicant may, within one month from the date of mailing of this International 
Search Report, submit comments to this Authority. 



6. The figure of the drawings to be published with the abstract is: 

Figure No. - Q as suggested by the applicant. 

| | because the applicant failed to suggest a figure. 

| | because this figure better characterizes the invention. 



| [ None of the figures. 
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^International application No. 

INTERNATIONAL SEARCH REPORT J pcT/us 98/ 145 29 



Box I Ob sen/at i ns wher rtain claims w re found unsearchable (Continuation of item 1 of first sheet) 

This International Search Report has not been established in respect of certain claims under Article 17{2)(a) for the following reasons: 
1. Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely; 



I Claims Nos.: 

because they relate to parts of the international Application that do not comply with the prescribed requirements to such 
an extent that no meaningful Internationa! Search can be carried out, specifically: 



3. I I Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 

Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 

This International Searching Authority found multiple inventions in this international application, as follows: 



1 . I | As all required additional search fees were timely paid by the applicant, this International Search Report covers all 
' ' searchable claims. 



2. | | As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 



□ As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
covers only those claims for which fees were paid, specifically claims Nos.: 



4. X No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 

Claim 1 completely, 4-17 partially. 



Remark on Protest | | The additional search fees were accompanied by the applicant's protest. 

| j No protest accompanied the payment of additional search fees. 



Form PCT/ISA/21 0 (continuation of first sheet (1 )) (July 1 992) 



International Application No. PCTV US 98/14529 



FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1 completely, 4-17 partially 

Expression vector compri sing nucleic acid encoding CP1, 
microorganisms having an inactivated gene for CP1, and 
optionally for CP2 and/or CP3, cleaning compositions 
containing CP1 and their use. 



2. Claims: 2 completely, 4-17 partially 

Expression vector comprising nucleic acid encoding CP2, 

microorganisms having an inactivated gene for CP2, and 

optionally for CP3, cleaning compositions containing CP2 and 
their use. 



3. Claims: 3 completely, 4-15 

Expression vector comprising nucleic acid encoding CP3, 
microorganisms having an inactivated gene for CP3, cleaning 
compositions containing CP3 and their use. 



INTERNATIONAL ifcAnun ncrun i , 

l^^fernationai Application >.w 

_P^T/US 98/14529 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC 6 C12N15/57 C12N1/20 C12N1/21 C11D3/386 

//C12N9/56,C12N9/90,(C12N1/20,C12R1:07) 

According to International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (class if ication system followed by classification symbols) 

IPC 6 C12N CUD 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 0 Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



WO 89 10975 A (PUBLIC HEALTH RESEARCH INST 
OF) 16 November 1989 
see the whole document 



MARGOT P ET AL: "The gene of the 
N-acetylglucosaminidase, a Bacillus 
subtilis 168 cell wall hydrolase not 
involved in vegetative cell autolysis. 
MOLECULAR MICROBIOLOGY, (1994 MAY) 12 
535-45. JOURNAL CODE: MOM. ISSN: 
0950-382X., XP002084429 
ENGLAND: United Kingdom 
see figure 3 

-& EMBL/GENBANK DATABASES Accession no 
P39841, Sequence reference MANU_BASCU, 
01-February 1995 
"Mannose-6-phosphate isomerase" 
XP002084431 



1,13 



11 



(4) 



-/-■ 



Further documents are listed in the continuation of box C. 



Patent family members are listed in annex. 



0 Special categories of cited documents : 

"A* document defining the general state of the art which is not 

considered to be of particular relevance 
"E" earlier document but published on or after the international 

filing date 

'L* document which may throw doubts on priority cfaim(s) or 
which is cited to establish the publication date of another 
citation or other special reason (as specified) 

*0* document referring to an oral disclosure, use, exhibition or 
other means 

"P" document published prior to the international filing date but 
later than the priority date claimed 



'T later document published after the international filing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

"X" document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

"Y* document of particular relevance; the claimed invention 

cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

document member of the same patent family 



Date of the actual completion of the international search 

16 November 1998 


Date of mailing of the international search report 

0 1 03. 99 


Name and mailing address of the ISA 

European Patent Office, P.B. 5818 Patentlaan 2 
NL - 2280 H V Rijswijk 
Tel. (+31-70) 340-2040, Tx. 31 651 epo nl, 
Fax: (+31-70) 340-3016 


Authorized officer 

VAN DER SCHAAL C.A. 



Form FCT/ISA/210 (second sheet) (Jury 1992) 
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ATIONAL itAMon ncru, . , 
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C(Continuation) DOCUMENTS CONSIDERED TO BE RELEVANT 



Category * Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



SADAIE Y ET AL: "Nucleotide sequence and 
analysis of the phoB-rrnE-grbESL region of 
the Bacillus subtilis chromosome" 
MICROBIOLOGY, 

vol. 143, June 1997, pages 1861-1866, 

XP002084430 

see table 1 

-& EMBL/GENBANK DATABASES Accession no 
D88802, Sequence reference BSD802, 

22 April 1995 
"ydhs" 
XPOO2084432 

EP 0 369 817 A (BIOTEKNIKA INTERNATIONAL) 

23 May 1990 

see the whole document 

KUNST F ET AL: "The complete genome 
sequence of the Gram-positive bacterium 
Bacillus subtilis" 
NATURE, 

vol. 390, 20 November 1997, pages 249-256, 

XP002080813 

see tables 1,11.1.1 
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INTE^ATIONAL btAnon ntrv,, . 

InlHPition on patent f amity members 



national Appuuauwi. 
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Patent document 
cited in search report 



Publication 
date 



Patent family 
member(s) 



Publication 
date 



AU 


622916 6 


30-04-1992 


All 




A 
M 


29-11-1989 


OK 




A 

A 


05-02-1990 


EP 




A 

A 


30-05-1990 


FI 




A 

A 


04-01-1990 


JP 




T 

1 


14-02-1991 


PT 




A 

A 


30-11-1989 


AT 


10/ COD 


T 

1 


15-05-1996 


AU 


4476689 


A 


24-05-1990 


CA 


2003078 


A 


18-05-1990 


DE 


68926333 


D 


30-05-1996 


DE 


68926333 


T 


28-11-1996 


DK 


577989 


A 


19-05-1990 


JP 


3067582 


A 


22-03-1991 


US 


5589383 


A 


31-12-1996 


US 


5620880 


A 


15-04-1997 



W0 8910976 



16-11-1989 



EP 0369817 



23-05-1990 



Form PC7/ISA/210 (patent family annex) (July 1992) 



PATENT COOPERATION TREATY! 

PCT 

INTERNATIONAL PRELIMINARY EXAMINATION REPORT 
(PCT Article 36 and Rule 70) 



REC'O 1 0 MAY 1399 



WIPO 



PCT 



Applicant's or agenfs file reference 
GC381-PCT 


FOR FURTHER ACTION ^ cc Notification of Transmittal of International 

Preliminary Examination Report (Form PCT/IPEA/416) 


International application No. 
PCT/US98/14529 


International filing date (day/monih/year) 
14 JULY 1998 


Priority date (day /month/year) 
15 JULY 1997 


International Patent Classification (IPC) or national classification and IPC 
Please See Supplemental Sheet. 


Applicant 

GENENCOR INTERNATIONAL, INC. 



1. This international preliminary examination report has been prepared by this International Preliminary 
Examining Authority and is transmitted tQJhs^applicant according to Article 36. 

2. This REPORT consists of a total of sheets. 

I I This report is also accompanied by ANNEXES, i.e., sheets of the description, claims and/or drawings which have 
■ — ' been amended and are the basis for this report and/or sheets containing rectifications made before this Authority, 
(see Rule 70.16 and Section 607 of the Administrative Instructions under the PCT). 

These annexes consist of a total of v — ^ sheets. 



This report contains indications relating to the following items: 
Basis of the report 
Priority 

Non-establishment of report with regard to novelty, inventive step or industrial applicability 



citations and explanations supporting such statement 
Certain documents cited 



I 


m 


II 


□ 


III 




IV 


□ 


V 


m 


VI 


□ 


vn 


□ 


vm 


□ 



Date of submission of the demand 
14 JANUARY 1999 


Date of completion of this report 
27 APRIL 1999 


Name and mailing address of the 1PEA/US 

Commissioner of Patents and Trademarks 

Box PCT 

Washington, D.C. 20231 
Facsimile No. (703) 305-3230 


Authorized officer JOYCE BWOGERS 

PARALEGAL SPECIALIST 

EINAR STOLE (^^W 1 M ^S* 
Telephone No. (703) 308-0 196 ^/^^ 



Form PCT/IPEA/409 (cover sheet) (January 1994)* 



t 

f 
1 



1 



INTERNATIONAL PRELIMINARY EXAMINATION REPORT 



International application No. 
PCT/US98/14529 



L Ba.U f t h rep rt ^ ^ been lunushed to tte 

jx] the international application as originally filed. 

iifi as originally filed. 

[x] the description, pages _LA° * 



pages NONE 
pages NONE 
pages . 



, filed with the demand. 
, filed with the letter of — 
, filed with the letter of — 



the claims* 



Nos. M7_ 



Nos. NONE, 
Nos. _±4QME- 
Nos. NONE 
Nos. - 



, as originally filed. 
_ , as amended under Article 19. 

s filed with the demand. 

y filed with the letter of 

filed with the letter of 



m the drawings, sheets/fig Mi 

I J _1 *«/PLer MOT 



sheets/fis NONE, 



sheets^ NONE, 
sheets/fig 



, as originally filed. 
, filed with the demand. 
, filed with the letter of . 
, filed with the letter of 



have resulted in the cancellation of: 
NONE 



2. The amendments 

QjTJ the description, pages 

the claims, Nos. 

£j the drawings, sheets/% NONE 



NONE 



t been made , since they have been cornered 



^report has been es^ 



, 1 — I This report has been esiauu^ - - v~~ ,, -nuuloiiirintnl Bmi 

□ to go beyond the disclosure as filed, as Seated in the S^&M****^ 



4. Additional observations, if necessary: 
NONE 



Form PCT/IPEA/409 (Box 1) (January 1994)* 



INTERNATIONAL PRELIMINARY EXAMINATION REPORT 



International application No. 
PCT/US98/14529 



EQ. N n-eitablishment of opinion with regard to n velty, inventive step and industrial applicability 

The question whether the claimed invention appears to be novel, to involve an inventive step (to be non-obvious), or to be 
industrially applicabl have not been and will not be examined in respect of: 

[ | the entire international application. 

fxl claims Nos. 2 and 3 

because: 

□ the said international application, or the said claim Nos. . relate to the following subject matter which 
does not require international preliminary examination (specify). 



□ the description, claims or drawings (indicate particular elements below) or said claims Nos, . are so unclear 
that no meaningful opinion could be formed (specify). 




the claims, or said claims Nos. _ are so inadequately supported by the description that no meaningful 
opinion could be formed. 




no international search report has been established for said claims Nos, 2 and 3. 



Form PCT/IPEA/409 (Box III) (January 1994)* 



INTERNATIONAL PRELIMINARY EXAMINATION REPORT 



International application No. 
PCT/US98/14529 



V. Reasoned statement under Article 35(2) with regard to n velty, inventive step or industrial applicability; 
citations and explanations supporting such statement 



1. STATEMENT 
Novelty (N) 

Inventive Step (IS) 
Industrial Applicability (IA) 



Claims 1 and 4" 17 



Claims NONE 



Claims 1 and 4-17 



Claims NONE 



Claims 1 and 4-17 



Claims NONE 



YES 
NO 

YES 
NO 



YES 
NO 



2. CITATIONS AND EXPLANATIONS 

Claims 1 and 4-17 meet the criteria set out in PCT Article 33(2X 4 ). because the prior art does not teach or fairly suggest: 
1) gram-positive microorganisms having a mutation or deletion of genes encoding CP1, CP2, or CP3 resulting in the 
inactivation of the proteolytic activity of the CPU CP2 or CP3 proteins; 2) cleaning compositions comprising one or more of 
the proteins CP1, CP2, and CP3, and 3) vectors comprising the DNA encoding the CP1, CP2, or CP3 proteins, and 
transformed host cells and expression systems thereof. 



NEW CITATIONS 

NONE 



Form PCT/IPEA/409 (Box V) (January 1994)* 
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INTERNATIONAL PRELIMINARY EXAMINATION REPORT 



International application No. 
PCT/US98/I4529 



Supplemental Box 

(To be used when the space in any of the preceding boxes is not sufficient) 



Continuation of: Boxes I - VIII Sheet 10 



CLASSIFICATION: 

The International Patent Classification (IPC) and/or the National classification are as listed below: 
IPC(6): C12N 1/20, 1/21, 9/56, 9/90, 15/57; C12R 1/07; CUD 3/386 and US CI.: 435/212, 219, 222, 252.1, 252.3, 
252.31, 320.1 



Form PCT/IPEA/409 (Supplemental Box) (January 1994)* 



09/462846 

PATENT COOPERATION TREATY 



From the - _*_.__.. 

INTERNATIONAL PRELIMINARY EXAMINING AUTHORITY 



To: DEBRA J. GLAISTER 

GENENCOR INTERNATIONAL* INC. ^ - 
925 PAGE MILL ROAD 

PALO ALTO, CA 94304-1013 /S /\ 

Av ;..» 


PCT 

NOTIFICATION OF TRANSMITTAL OF 
INTERNATIONAL PRELIMINARY '- 
EXAMINATION REPORT 

< (PCT Rule 71.1) "^6-" 
]£ n 


Date of Mailing 

(day/month/year) Q g jyj^y ^qqq 


Applicant's or agent* s file reference 
GC381-PCT 


IMPORTANT NOTIFICATION 


International application No. International filing date (day /month/year) Priority Date (day /month/year) 
PCT/US98/14529 14 JULY 1998 15 JULY 1997 


Applicant ^ 
GENENCOR INTERNATIONAL, INC. 



] . The applicant is hereby notified that this International Preliminary Examining Authority transmits herewith the 
international preliminary examination report and its annexes, if any, established on the international application. 



2. A copy of the report and its annexes, if any, is being transmitted to the International Bureau for communication 
to all the elected Offices. 



3. Where required by any of the elected Offices, the International Bureau will prepare an English translation of 
the report (but not of any annexes) and will transmit such translation to those Offices. 



4. REMINDER 

The applicant must enter the national phase before each elected Office by performing certain acts (filing 
translations and paying national fees) within 30 months from the priority date (or later in some Offices)(Article 
39(l))(see also the reminder sent by the International Bureau with Form PCT/IB/301). 

Where a translation of the international application must be furnished to an elected Office, that translation must 
contain a translation of any annexes to the international preliminary examination report. It is the applicant's 
responsibility to prepare and furnish such translation directly to each elected Office concerned. 

For further details on the applicable time limits and requirements of the elected Offices, see Volume II of the 
PCT Applicant's Guide. 





Name and mailing address of the 1PEA/US 

Commissioner of Patents and Trademarks 
Box PCT 

Washington, D.C. 20231 
Facsimile No. (703) 305-3230 


Authorized officer *«* 

PARALEGAL SPECIALIST 
EINAR STOLE CHEMICAL MATRIX 

Telephone No. (703) 308-0196 Ot?**^ 



Form PCT/1PEA/416 (July 1992)* 



PATENT COOPERATION TREATY 

PCT 

INTERNATIONAL PRELIMINARY EXAMINATION REPORT 
(PCT Article 36 and Rule 70) 



Applicant's or agent's file reference 
GC381-PCT 


FOR FURTHER ACTION See Notification of Transmittal of International 

Preliminary Examination Report (Form PCT/IPEA/416) 


International application No. 
PCT/US98/14529 


International filing date (day/month/year) 
14 JULY 1998 


Priority date (day /month/year) 
15 JULY 1997 


International Patent Classification (IPC) or national classification and IPC 
Please See Supplemental Sheet. 


Applicant 

GENENCOR INTERNATIONAL, INC. 



1. This international preliminary examination report has been prepared by this International Preliminary 
Examining Authority and is transmitted tp_lhe_ a Ppli cant according to Article 36. 

2. This REPORT consists of a total of ^""^ sheets. 

| I This report is also accompanied by ANNEXES, i.e., sheets of the description, claims and/or drawings which have 
' — ' been amended and are the basis for this report and/or sheets containing rectifications made before this Authority, 
(see Rule 70.16 and Section 607 of the Administrative Instructions under the PCT). 

These annexes consist of a total of v — ^ sheets. 



This report contains indications relating to the following items: 
Basis of the report 



I 


E 


II 


□ 


III 


LI 


IV 


□ 


V 




VI 


□ 


vn 


□ 


vra 


□ 



citations and explanations supporting such statement 
Certain documents cited 



Date of submission of the demand 
14 JANUARY 1999 


Date of completion of this report 
27 APRIL 1999 


Name and mailing address of the I PEA /US 

Commissioner of Patents and Trademarks 
Box PCT 

Washington, D.C 20231 
Facsimile No. (703) 305-3230 


Authorized officer JOYCE BWDGERS 

PARALEGAL SPECIALIST 

EINAR STOLE /^gAL MM*J* 
Telephone No. (703) 308-0196 ^^^^ f^*^ 



Form PCT/IPt, A/409 (cover sheet) (January 1994)* 



INTERNATIONAL PRELIMINARY EXAMINATION REPORT 



International application No. 
PCT/US98/I4529 



L Basis f the report 



1. This report has been drawn oo the basis of (Substitute sheets vJuch have been famished to the receiving Office at response to an imitation 
wxter Article 14 are referred to in this report as "originalfy fled* and are not annexed to the report since they do not contain amenato 



| x) the international application as originally filed. 
fx] the description, pages 1-16 



pages NONE 
pages NONE 
pages 



| x| the claims, Nos. 



-17 



Nos. NONE 



Nos. NONE 



Nos. NONE 
Nos. 



| the drawings, sheets/fig 1-11 



sheets/6g NONE 



sheets*^ NONE 
sheets/fig 



, as originally filed. 
, filed with the demand. 
, filed with the letter of . 
, filed with the letter of . 



, as originally filed. 

, as amended under Article 19. 

, filed with the demand. 

, filed with the letter of 

, filed with the letter of 



, as originally filed. 
, filed with the demand. 
, filed with the letter of . 
, filed with the letter of . 



2. The amendments have resulted in the cancellation of: 

| x| the description, pages NONE 

[7] the claims, Nos. NQNE 



|~x] the drawings, sheets/fig NONE 



3. I | This report has been established as if (some of) the amendments had not been made, since they have been considered 
— to go beyond the disclosure as filed, as indicated in the Supplemental Box Additional observations below (Rule 70.2(c)). 



4. Additional observations, if necessary: 
NONE 



Form PCT/I PEA/409 (Box I) (January 1994)* 



INTERNATIONAL PRELIMINARY EXAMINATION REPORT 



Internationa! application No. 
PCT/US98/14529 



m. Non-establishment f opini n with regard to nov lry f inv ntiv step and industrial applicability 

The question whether the claimed invention appears to be novel, to involve an inventive step (to be non-obvious), r to be 
industrially applicable have not been and will not be xamined in respect of: 

[ | the entire international application. 

| X| claims Nos. 2 and 3 



because: 

□ 



□ 



the said international application, or the said claim Nos. . relate to the following subject matter which 
does not require international preliminary examination (specify). 



the description, claims or drawings (indicate particular elements below) or said claims Nos. _ are so unclear 
that no meaningful opinion could be formed (specify). 



□ the claims, or said claims Nos. _ are so inadequately supported by the description that no meaningful 
opinion could be formed. 

no international search report has been established for said claims Nos. 2 and 3 . 



Form PCT/IPEA/409 (Box III) (January 1994)* 



International application No. 
PCT/US98/14529 



V. Reasoned statement under Article 35(2) with regard to n velty, inventive step r industrial applicability; 
citations and explanations supp rting such statement 



1. STATEMENT 

Novelty (N) Claims 1 and 4-17 YES 

Claims NONE NO 

Inventive Step (IS) Claims 1 and 4-17 YES 

Claims NONE NO 

Industrial Applicability (IA) Claims 1 4-17 YES 

Claims NONE NO 



2. CITATIONS AND EXPLANATIONS 

Claims 1 and 4*17 meet the criteria set out in PCT Article 33(2 X 4 X because the prior art does not teach or fairly suggest 
1) gram-positive microorganisms having a mutation or deletion of genes encoding CPl. CP2, or CP3 resulting in the 
inacu'van'on of the proteolytic activity of the CPl, CP2 or CP3 proteins; 2) cleaning compositions comprising one or more of 
the proteins CPl, CP2, and CP3, and 3) vectors comprising the DNA encoding the CPl, CP2, or CP3 proteins, and 
transformed host cells and expression systems thereof. 

NEW CITATIONS 

NONE 
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International application No. 
PCT/US98/14529 



Supplemental Box 

(To be used when the space in any of the preceding boxes is not sufficient) 



Continuation of: Boxes I - VIII 



Sheet 10 



CLASSIFICATION : 

The International Patent Classification (IPC) and/or the National classification are as listed below: 
IPC(6): C12N 1/20, 1/21, 9/56. 9/90. 15/57; C12R 1/07; CI ID 3/386 and US CI.: 435/212,219,222,252.1,252.3, 
252.31, 320.1 



Form PCT/1PEA/409 (Supplemental Box) (January 1994)* 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 
C12N 15/57, 1/20, 1/21, CUD 3/386 



A2 



(11) International Publication Number: WO 99/04016 

(43) International Publication Date: 28 January 1999 (28.01.99) 



(21) International Application Number: PCT/US98/ 1 4529 

(22) International Filing Date: 14 July 1998 (14.07.98) 



(30) Priority Data: 
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The present invention relates to the identification of novel cysteine proteases in Gram-positive microorganisms. The present invention 
provides the nucleic acid and amino acid sequences for the Bacillus subtilis cysteine proteases CP1, CP2 and CP3. The present invention 
also provides host cells having a mutation or deletion of part or all of the gene encoding CP1, CP2 or CP3. The present invention also 
provides host cells further comprising nucleic acid encoding desired heterologous proteins such as enzymes. The present invention also 
provides a cleaning composition comprising a cysteine protease of the present invention. 
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PROTEASES FROM "GRANT-POSITIVE ORGANISMS . 

FIELD OF THE INVENTION 

The present invention relates to cysteine proteases derived from gram- 
positive microorganisms. The present invention provides nucleic acid and amino acid 
sequences of cysteine protease 1 , 2 and 3 identified in Bacillus. The present invention also 
provides methods for the production of cysteine protease 1, 2 and 3 in host ceils as well as 
the production of heterologous proteins in a host cell having a mutation or deletion of part or 
all of at least one of the cysteine proteases of the present invention. 

BACKGROUND OF THE INVENTION 

Gram-positive microorganisms, such as members of the group Bacillus, have been 
used for large-scale industrial fermentation due, in part, to their ability to secrete their 
fermentation products into the culture media. In gram-positive bacteria, secreted proteins 
are exported across a cell membrane and a cell wall, and then are subsequently released 
into the external media usually maintaining their native conformation. 

Various gram-positive microorganisms are known to secrete extracellular and/or 
intracellular protease at some stage in their life cycles. Many proteases are produced in 
large quantities for industrial purposes. A negative aspect of the presence of proteases in 
gram-positive organisms is their contribution to the overall degradation of secreted 
heterologous or foreign proteins. 

The classification of proteases found in microorganisms is based on their catalytic 
mechanism which results in four groups: the serine proteases; metalloproteases; cysteine 
proteases; and aspartic proteases. These categories can be distinguished by their 
sensitivity to various inhibitors. For example, the serine proteases are inhibited by 
phenylmethylsulfonylfluoride (PMSF) and diisopropyifluorophosphate (DIFP); the 
metalloproteases by chelating agents; the cysteine enzymes by iodoacetamide and heavy 
metals and the aspartic proteases by pepstatin. The serine proteases have alkaline pH 
optima, the metalloproteases are optimally active around neutrality, and the cysteine and 
aspartic enzymes have acidic pH optima ( Biotechnology Handbooks, Bacillus , vol. 2, edited 
by Harwood, 1989 Plenum Press, New York). 

The activity of cysteine protease depends on a catalytic dyad of cysteine and 
histidine with the order differing among families. The best known family of cysteine 
proteases is that of papain having catalytic residues Cys-25 and His-159. Cysteine 
proteases of the papain family catalyze the hydrolysis of peptide, amide, ester, thiol ester 
and thiono ester bonds. Naturally occurring inhibitors of cysteine proteases of the papain 
family are those of the cystatin family (Methods in Enzymology, vol. 244, Academic Press, 
Inc. 1994). 
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SUMMARY OF THE INVENTION ------ 

The present invention relates to the unexpected and surprising discovery of three 
heretofore unknown or unrecognized cysteine proteases found in Bacillus subtilis, 
designated herein as CP1, CP2 and CP3, having the nucleic acid and amino acid as shown 
in Figures 1A-1B, Figures 5A-5B and 6A-6B, respectively. The present invention is based, in 
part, upon the presence of the characteristic cysteine protease amino acid motif GXCWAF 
found in uncharacterised translated genomic nucleic acid sequences of Bacillus subtilis. 
The present invention is also based in part upon the structural relatedness that CP1 has 
with the cysteine protease papain specifically with respect to the location of the catalytic 
histidine/alanine and asparagine/serine residues and the structural relatedness that CP1 
has with CP2 and CP3. 

The present invention provides isolated polynucleotide and amino acid sequences 
for CP1, CP2 and CP3. Due to the degeneracy of the genetic code, the present invention 
encompasses any nucleic acid sequence that encodes the CP1, CP2 and CP3 amino acid 
sequence shown in the Figures. 

The present invention encompasses amino acid variations of B. subtilis CP1, CP2 
and CP3 amino acids disclosed herein that have proteolytic activity. B. subtilis CP1, CP2 
and CP3 t as well as proteolytically active amino acid variations thereof, have application in 
cleaning compositions. In one aspect of the present invention, CP1, CP2 or CP3 obtainable 
from a gram-positive microorganism is produced on an industrial fermentation scale in a 
microbial host expression system. In another aspect, isolated and purified recombinant 
CP1, CP2 or CP3 obtainable from a gram-positive microorganism is used in compositions of 
matter intended for cleaning purposes, such as detergents. Accordingly, the present 
invention provides a cleaning composition comprising at least one of CP1, CP2 and CP3 
obtainable from a gram-positive microorganism. The cysteine protease may be used alone 
in the cleaning composition or in combination with other enzymes and/or mediators or 
enhancers. 

The production of desired heterologous proteins or polypeptides in gram-positive 
microorganisms may be hindered by the presence of one or more proteases which degrade 
the produced heterologous protein or polypeptide. Therefore, the present invention also 
encompasses gram-positive microorganism having a mutation or deletion of part or all of the 
gene encoding CP1 and/or CP2 and/or CP3, which results in the inactivation of the CP1 
and/or CP2 and/or CP3 proteolytic activity, either aione or in combination with deletions or 
mutations in other proteases, such as apr, npr, epr, mpr for example, or other proteases 
known to those of skill in the art. In one embodiment of the present invention, the gram- 
positive organism is a member of the genus Bacillus. In another embodiment, the Bacillus 
is Bacillus subtilis. 

In another aspect, the gram-positive microorganism host having one or more 
deletions or mutations in a cysteine protease of the present invention is further genetically 
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engineered to producea desired protein. In one embodiment of the present invention, the 
desired protein is heterologous to the gram-positive host celt, in another embodiment, the 
desired protein is homologous to the host cell. The present invention encompasses a gram- 
positive host cell having a deletion or interruption of the naturally occurring nucleic acid 
encoding the homologous protein, such as a protease, and having nucleic acid encoding 
the homologous protein or a variant thereof re-introduced in a recombinant form. In another 
embodiment, the host cell produces the homologous protein. Accordingly, the present 
invention also provides methods and expression systems for reducing degradation of 
heterologous or homologous proteins produced in gram-positive microorganisms comprising 
the steps of obtaining a Bacillus host cell comprising nucleic acid encoding said 
heterologous protein wherein said host cell contains a mutation or deletion in at least one of 
the genes encoding cysteine protease 1, cysteine protease 2 and cysteine protease 3; and 
growing said Bacillus host cell under conditions suitable for the expression of said 
heterologous protein. The gram-positive microorganism may be normally sporulating or 
non-sporuiating. 

The present invention provides methods for detecting gram positive microorganism 
homologs of B. subtilis CP1, CP2 and CP3 that comprises hybridizing part or all of the 
nucleic acid encoding B. subtilis CP1, CP2 and CP3 with nucleic acid derived from gram- 
positive organisms, either of genomic or cDNA origin. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1A-1B shows the DNA (SEQ ID NO:1) and amino acid sequence for CP1 
(YJDE) (SEQ ID NO:2). 

Figure 2 shows an amino acid alignment with papain (SEQ ID NO:3) (accession 
number papa_carpa.p) with the cysteine protease CP1, designated YJDE. For Figures 2, 3 
and 4 /the motif GXCWAF has been marked along with the catalytic cysteine and the 
conserved catalytic histidine/alanine and asparagine/serine residues. 

Figure 3 shows amino acid alignment of CP1 (YJDE) (SEQ ID NO:2) with CP3 (PMI) 
(SEQ ID NO:5). 

Figure 4 shows the amino acid alignment of CP1 (YJDE) (SEQ ID NO:2) with CP2 

(YdhS). 

Figure 5A-5B shows the amino acid (SEQ ID NO:6) and nucleic acid sequence for 
CP2 (YdhS) (SEQ ID NO:7). 

Figure 6A-6B shows the amino acid (SEQ ID NO:4) and nucleic acid sequence for 
CP3 (PMI) (SEQ ID NO:5). 
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DETAILED DESCRIPTION OF THE" PREFERRED EMBODIMENTS 
Definitions 

As used herein, the genus Bacillus includes all members known to those of skill in 
the art, including but not limited to S. subtilis, B. licheniformis, B. lentus, B. brevis, B. 
stearothermophilus, B. alkalophilus, B. amyloliquefaciens, B. coagulans, B. ciculans, B. 
lautus and B. thuringiensis. 

The present invention relates to novel CP1, CP2 and CP3 from gram positive 
organisms. In a preferred embodiment, the gram-positive organisms is a Bacillus. In 
another preferred embodiment, the gram-positive organism is Bacillus subtilis. As used 
herein, "S. subtilis CP1, CP2 or CP3" refers to the amino acid sequences shown in Figures. 
Figures 1A-1B show the amino acid and nucleic acid seqeunce for CP1 (YJDE); Figures 5A- 
5B show the amino acid and nucleic acid sequence for CP2 (YDHS); and Figures 6A-6B 
show the amino acid and nucleic acid sequences for CP3 (PMI). The present invention 
encompasses amino acid variations of the amino acid sequences disclosed in Figures 1A- 
1B and 5A-5B and 6A-6B that have proteolytic activity. Such proteolytic amino acid variants 
can be used in cleaning compositions. 

As used herein, "nucleic acid" refers to a nucleotide or polynucleotide sequence, and 
fragments or portions thereof, and to DNA or RNA of genomic or synthetic origin which may 
be double-stranded or single-stranded, whether representing the sense or antisense strand. 
As used herein "amino acid" refers to peptide or protein sequences or portions thereof. A 
"polynucleotide homolog" as used herein refers to a gram-positive microorganism 
polynucleotide that has at least 80%, at least 90% and at least 95% identity to B. subtilis 
CP1 , CP2 or CP3, or which is capable of hybridizing to B. subtilis CP1 , CP2 or CP3 under 
conditions of high stringency and which encodes an amino acid sequence having cysteine 
protease activity. 

The terms "isolated" or "purified" as used herein refer to a nucleic acid or amino acid 
that is removed from at least one component with which it is naturally associated. 

As used herein, the term "heterologous protein" refers to a protein or polypeptide 
that does not naturally occur in a gram-positive host celt. Examples of heterologous 
proteins include enzymes such as hydrolases including proteases, cellulases, amylases, 
carbohydrases, and lipases; isomerases such as racemases, epimerases, tautomerases, or 
mutases; transferases, kinases and phophatases. The heterologous gene may encode 
therapeutically significant proteins or peptides, such as growth factors, cytokines, ligands, 
receptors and inhibitors, as well as vaccines and antibodies. The gene may encode 
commercially important industrial proteins or peptides, such as proteases, carbohydrases 
such as amylases and glucoamylases, cellulases, oxidases and lipases. The gene of 
interest may be a naturally occurring gene, a mutated gene or a synthetic gene. 

The term "homologous protein" refers to a protein or polypeptide native or naturally 
occurring in a gram-positive host cell. The invention includes host cells producing the 
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homologous protein via recombinant DNA technology." The present invention encompasses - 
a gram-positive host cell having a deletion or interruption of naturally occurring nucleic acid 
encoding the homologous protein, such as a protease, and having nucleic acid encoding 
the homologous protein, or a variant thereof, re-introduced in a recombinant form. In 
another embodiment, the host cell produces the homologous protein. 

As used herein, the term "overexpressing" when refering to the production of a 
protein in a host cell means that the protein is produced in greater amounts than its 
production in its naturally occurring environment. 

As used herein, the phrase "proteolytic activity" refers to a protein that is able to 
hydrolyze a peptide bond. Enzymes having proteolytic activity are described in Enzyme 
Nomenclature, 1992, edited Webb Academic Press, Inc. 

Detailed Description of the Preferred Embodiments 

The unexpected discovery of the cysteine proteases CP1, CP2 and CP3 in B.subtilis 
provides a basis for producing host cells, expression methods and systems which can be 
used to prevent the degradation of recombinantly produced heterologous proteins. In a 
preferred embodiment, the host celt is a gram-positive host cell that has a deletion or 
mutation in the naturally occurring cysteine protease said mutation resulting in deletion or 
inactivation of the production by the host cell of the proteolytic cysteine protease gene 
product. The host cell may additionally be genetically engineered to produced a desired 
protein or polypeptide. 

It may also be desired to genetically engineer host cells of any type to produce a 
gram-positive cysteine protease. Such host cells are used in large scale fermentation to 
produce large quantities of the cysteine protease which may be isolated or purified and 
used in cleaning products, such as detergents. 

I. Cysteine Protease Sequences 

The CP1, CP2 and CP3 polynucleotides having the sequences as shown in Figures 
1A-1B, 5A-5B and 6A-6B, respectively, encode the Bacillus subtilis cysteine proteases CP1, 
CP2 and CP3. As will be understood by the skilled artisan, due to the degeneracy of the 
genetic code, a variety of polynucleotides can encode the Bacillus subtilis CP1, CP2 and 
CP3. The present invention encompasses all such polynucleotides. - - 

The present invention encompasses CP1, CP2 and CP3 polynucleotide homologs 
encoding gram-positive microorganism cysteine proteases CP1, CP2 and CP3, respectively, 
which have at least 80%, or at least 90% or at least 95% identity to B.subtilis CP1, CP2 and 
CP3 as long as the homolog encodes a protein that has proteolytic activity. 

Gram-positive polynucleotide homologs of B.subtilis CP1, CP2 or CP3 may be obtained 
by standard procedures known in the art from, for example, cloned DNA (e.g., a DNA "library"), 
genomic DNA libraries, by chemical synthesis once identified, by cDNA cloning, or by the 
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cloning of genomic DNA, or fragments thereof, purified from a desired ceil.- (See, for example,. 
Sambrook etal. x 1989, Molecular Cloning, A Laboratory Manual, 2d Ed., Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York; GSover, D.M. (ed.), 1985, DNA Cloning: A 
Practical Approach, MRL Press, Ltd., Oxford, U.K. Vol. I, II.) A preferred source is from 
genomic DNA. Nucleic acid sequences derived from genomic DNA may contain regulatory 
regions in addition to coding regions. Whatever the source, the isolated CP1, CP2 or CP3 
gene should be molecularly cloned into a suitable vector for propagation of the gene. 

In the molecular cloning of the gene from genomic DNA, DNA fragments are generated, 
some of which will encode the desired gene. The DNA may be cleaved at specific sites using 
various restriction enzymes. Alternatively, one may use DNAse in the presence of manganese 
to fragment the DNA, or the DNA can be physically sheared, as for example, by sonication. 
The linear DNA fragments can then be separated according to size by standard techniques, 
including but not limited to, agarose and polyacrylamide gel electrophoresis and column 
chromatography. 

Once the DNA fragments are generated, identification of the specific DNA fragment 
containing the CP1 , CP2 or CP3 may be accomplished in a number of ways. For example, 
a B.subtilis CP1, CP2 or CP3 gene of the present invention or its specific RNA, or a 
fragment thereof, such as a probe or primer, may be isolated and labeled and then used in 
hybridization assays to detect a gram-positive CP1, CP2 orCP3 gene. (Benton, W. and 
Davis, R., 1977, Science 196 :180; Grunstein, M. And Hogness, D., 1975, Proc. Natl. Acad. 
Sci. USA 72:3961). Those DNA fragments sharing substantia! sequence similarity to the 
probe will hybridize under stringent conditions. 

Accordingly, the present invention provides a method for the detection of gram- 
positive CP1, CP2 and CP3 polynucleotide homologs which comprises hybridizing part or all 
of a nucleic acid sequence of S. subtilts CP1 , CP2 and CP3 with gram-positive 
microorganism nucleic acid of either genomic or cDNA origin. 

Also included within the scope of the present invention are gram-positive 

microorganism polynucleotide sequences that are capable of hybridizing to the nucleotide 

sequence of B.subtilis CP1, CP2 or CP3 under conditions of intermediate to maximal 

stringency. Hybridization conditions are based on the melting temperature (Tm) of the 

nucleic acid binding complex, as taught in Berger and Kimmel (1987, Guide to Molecular 

Cloning Techniques. Methods in Enzymotoqy, Vol 152, Academic Press. San Diego CA) 

incorporated herein by reference, and confer a defined "stringency" as explained below. 
"Maximum stringency" 1 typically occurs at about Tm-5°C (5°C below the Tm of the 

probe); "high stringency" at about 5°C to 10°C below Tm; "intermediate stringency" at about 

10°C to 20°C below Tm; and "low stringency" at about 20°C to 25°C below Tm. As will be 

understood by those of skill in the art, a maximum stringency hybridization can be used to 
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identify or detect identicarpoiynucleotide-sequences-while an. intermediate or low stringency 
hybridization can be used to identify or detect polynucleotide sequence homologs. 

The term "hybridization" as used herein shall include "the process by which a strand 
of nucleic acid joins with a complementary strand through base pairing" (Coombs J (1994) 
Dictionary of Biotechnology, Stockton Press, New York NY). 

The process of amplification as carried out in polymerase chain reaction (PCR) 
technologies is described in Dieffenbach CW and GS Dveksler (1995, PCR Primer, a 
Laboratory Manual, Cold Spring Harbor Press, Plainview NY). A nucleic acid sequence of 
at least about 10 nucleotides and as many as about 60 nucleotides from S. subtilis CP1, 
CP2 or CP3 preferably about 12 to 30 nucleotides, and more preferably about 20-25 
nucleotides can be used as a probe or PCR primer. 

The S. subtilis amino acid sequences CP1, CP2 and CP3 (shown in Figures 2, 4 and 
3, respectively) were identified via a FASTA search of Bacillus subtilis genomic nucleic acid 
sequences. B. subtilis CP1 (YJDE) was identified by its structural homology to the cysteine 
protease papain having the sequence designated "papa_carpa.p". As shown in Figure 2, 
YJDE has the motif GXCWAF as well as the conserved catalytic residues His/Ala and 
Asn/Ser. CP2 (YdHS) and CP3 (PMI) were identified upon their structural homology to CP1 
(YJDE). The presence of GXCWAF as well as residues His/Ala and Asn/Ser is noted in 
Figures 3 and 4. CP3 (PMI) was previously characterized as a possible phosphomannose 
isomerase, (Noramata). There has been no previous characterization of CP3 as a cysteine 
protease. 

H. Expression Systems 

The present invention provides host cells, expression methods and systems for the 
enhanced production and secretion of desired heterologous or homologous proteins in 
gram-positive microorganisms. In one embodiment, a host cell is genetically engineered to 
have a deletion or mutation in the gene encoding a gram-positive CP1, CP2 orCP3 such 
that the respective activity is deleted, in another embodiment of the present invention, a 
gram-positive microorganism is genetically engineered to produce a cysteine protease of 
the present invention. 

Inactivation of a gram-positive cysteine protease in a host cell 
Producing an expression host cell incapable of producing the naturally occurring 
cysteine protease necessitates the replacement and/or inactivation of the naturally occurring 
gene from the genome of the host cell. In a preferred embodiment, the mutation is a non- 
reverting mutation. 

One method for mutating nucleic acid encoding a gram-positive cysteine protease is 
to clone the nucleic acid or part thereof, modify the nucleic acid by site directed 
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mutagenesis and reintroduce the mutated" nucleic acid'into the cell on a plasmid. By- 
homologous recombination, the mutated gene may be introduced into the chromosome. In 
the parent host cell, the result is that the naturally occurring nucleic acid and the mutated 
nucleic acid are located in tandem on the chromosome. After a second recombination, the 
modified sequence is left in the chromosome having thereby effectively introduced the 
mutation into the chromosomal gene for progeny of the parent host cell. 

Another method for inactivating the cysteine protease proteolytic activity is through 
deleting the chromosomal gene copy. In a preferred embodiment, the entire gene is 
deleted, the deletion occurring in such as way as to make reversion impossible. In another 
preferred embodiment, a partial deletion is produced, provided that the nucleic acid 
sequence left in the chromosome is too short for homologous recombination with a plasmid 
encoded cysteine protease gene. In another preferred embodiment, nucleic acid encoding 
the catalytic amino acid residues are deleted. 

Deletion of the naturally occurring gram-positive microorganism cysteine protease 
can be carried out as follows. A cysteine protease gene including its 5* and 3' regions is 
isolated and inserted into a cloning vector. The coding region of the cysteine protease gene 
is deleted form the vector in vitro, leaving behind a sufficient amount of the 5' and 3' 
flanking sequences to provide for homologous recombination with the naturally occurring 
gene in the parent host cell. The vector is then transformed into the gram-positive host cell. 
The vector integrates into the chromosome via homologous recombination in the flanking 
regions. This method leads to a gram-positive strain in which the protease gene has been 
deleted. 

The vector used in an integration method is preferably a plasmid. A selectable 
marker may be included to allow for ease of identification of desired recombinant 
microorgansims. Additionally, as will be appreciated by one of skill in the art, the vector is 
preferably one which can be selectively integrated into the chromosome. This can be 
achieved by introducing an inducible origin of replication, for example, a temperature 
sensitive origin into the piasmid. By growing the transformants at a temperature to which 
the origin of replication is sensitive, the replication function of the plasmid is inactivated, 
thereby providing a means for selection of chromosomal integrants. Integrants may be 
selected for growth at high temperatures in the presence of the selectable marker, such as 
an antibiotic. Integration mechanisms are described in WO 88/06623. 

Integration by the Campbell-type mechanism can take place in the 5' flanking region 
of the protease gene, resulting in a protease positive strain carrying the entire piasmid 
vector in the chromosome in the cysteine protease locus. Since illegitimate recombination 
will give different results it will be necessary to determine whether the complete gene has 
been deleted, such as through nucleic acid sequencing or restriction maps. 

Another method of inactivating the naturally occurring cysteine protease gene is to 
mutagenize the chromosomal gene copy by transforming a gram-positive microorganism 
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with ofigonucieotides which are mutagenic. Alternatively, the chromosomal cysteine- 
protease gene can be replaced with a mutant gene by homologous recombination. 

The present invention encompasses host cetis having deletions or mutations of a 
cysteine protease of the present invention as well as additional protease deletions or 
mutations, such as deletions or mutations in apr, npr, epr, mpr and others known to those of 
skill in the art. United States Patent 5,264,366 discloses Bacillus host cells having a 
deletion of apr and npr; United States Patent 5,585,253 discloses Bacillus host cells having 
a deletion of epr; Margot et a!., 1996, Microbiology 142: 3437-3444 disclose host cells 
having a deletion in wpr and EP patent 0369817 discloses Bacillus host ceils having a 
deletion of mpr. 

One assay for the detection of mutants involves growing the Bacillus host celt on 
medium containing a protease substrate and measuring the appearance or lack thereof, of 
a zone of clearing or halo around the colonies. Host cells which have an inactive protease 
will exhibit little or no halo around the colonies. 

HI, Production of Cysteine Protease 

For production of cysteine protease in a host cell, an expression vector comprising at 
least one copy of nucleic acid encoding a gram-positive microorganism CP1, CP2 or CP3, 
and preferably comprising multiple copies, is transformed into the host cell under conditions 
suitable for expression of the cysteine protease. In accordance with the present invention, 
polynucleotides which encode a gram-positive microorganism CP1, CP2 or CP3, or 
fragments thereof, or fusion proteins or polynucleotide homotog sequences that encode 
amino acid variants of B.subtilis CP1, CP2 or CP3, may be used to generate recombinant 
DNA molecules that direct their expression in host cells. In a preferred embodiment, the 
gram-positive host cell belongs to the genus Bacillus. In another preferred embodiment, the 
gram positive host cell is B. subtilis. 

As wilt be understood by those of skiIMn the art, it may be advantageous to produce 
polynucleotide sequences possessing non-naturaiiy occurring codons. Codons preferred by 
a particular gram-positive host cell (Murray E et al (1989) Nuc Acids Res 17:477-508) can 
be selected, for example, to increase the rate of expression or to produce recombinant RNA 
transcripts having desirable properties, such as a longer half-life, than transcripts produced 
from naturally occurring sequence. 

Altered CP1, CP2 or CP3 polynucleotide sequences which may be used in 
accordance with the invention include deletions, insertions or substitutions of different 
nucleotide residues resulting in a polynucleotide that encodes the same or a functionally 
equivalent CP1, CP2 or CP3 homolog, respectively. As used herein a "deletion" is defined 
as a change in either nucleotide or amino acid sequence in which one or more nucleotides 
or amino acid residues, respectively, are absent. 
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As used herein an "insertion" or "addition" is that change in a nucleotide or amino _ 
acid sequence which has resulted in the addition of one or more nucleotides or amino acid 
residues, respectively, as compared to the naturally occurring CP1, CP3 or CP3. 

As used herein "substitution" results from the replacement of one or more 
nucleotides or amino acids by different nucleotides or amino acids, respectively. 

The encoded protein may also show deletions, insertions or substitutions of amino 
acid residues which produce a silent change and result in a functionally CP1, CP2 or CP3 
variant. Deliberate amino acid substitutions may be made on the basis of similarity in 
polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of 
the residues as long as the variant retains the ability to modulate secretion. For example, 
negatively charged amino acids include aspartic acid and glutamic acid; positively charged 
amino acids include lysine and arginine; and amino acids with uncharged polar head groups 
having similar hydrophilicity values include leucine, isoleucine, valine; glycine, alanine; 
asparagine, glutamine; serine, threonine, phenylalanine, and tyrosine. 

The CP1, CP2 or CP3 polynucleotides of the present invention may be engineered 
in order to modify the cloning, processing and/or expression of the gene product. For 
example, mutations may be introduced using techniques which are well known in the art, 
eg, site-directed mutagenesis to insert new restriction sites, to alter glycosylation patterns or 
to change codon preference, for example. 

In one embodiment of the present invention, a gram-positive microorganism CPt, 
CP2 or CP3 polynucleotide may be ligated to a heterologous sequence to encode a fusion 
protein. A fusion protein may also be engineered to contain a cleavage site located 
between the cysteine protease nucleotide sequence and the heterologous protein 
sequence, so that the cysteine protease may be cleaved and purified away from the 
heterologous moiety. 

IV. Vector Sequences 

Expression vectors used in expressing the cysteine proteases of the present 
invention in gram-positive microorganisms comprise at least one promoter associated with a 
cysteine protease selected from the group consisting of CP1, CP2 and CP3, which promoter 
is functional in the host cell. In one embodiment of the present invention, the promoter is 
the wild-type promoter for the selected cysteine protease and in another embodiment of the 
present invention, the promoter is heterologous to the cysteine protease, but still functional- 
in the host cell. In one preferred embodiment of the present invention, nucleic acid 
encoding the cysteine protease is stably integrated into the microorganism genome. 

in a preferred embodiment, the expression vector contains a multiple cloning site 
cassette which preferably comprises at least one restriction endonuciease site unique to the 
vector, to facilitate ease of nucleic acid manipulation. In a preferred embodiment, the vector 
also comprises one or more selectable markers. As used herein, the term selectable 
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marker refers to a gene capable of expression in the gram-positive host which allows for 
ease of selection of those hosts containing the vector. Examples of such selectable 
markers include but are not limited to antibiotics, such as, erythromycin, actinomycin, 
chloramphenicol and tetracycline. 

V, Transformation 

A variety of host cells can be used for the production of CP1 , 
CP2 and CP3 including bacterial, fungal, mammalian and insects cells. General 
transformation procedures are taught in Current Protocols In Molecular Biology (vol. 1, 
edited by Ausubel et a!., John Wiley & Sons, Inc. 1987, Chapter 9) and include calcium 
phosphate methods, transformation using DEAE-Dextran and electroporation. Plant 
transformation methods are taught in Rodriquez (WO 95/14099, published 26 May 1995). 

!n a preferred embodiment, the host cell is a gram-positive microorganism and in 
another preferred embodiment, the host cell is Bacillus, in one embodiment of the present 
invention, nucleic acid encoding one or more cysteine protease(s) of the present invention is 
introduced into a host cell via an expression vector capable of replicating within the Bacillus 
host cell. Suitable replicating ptasmids for Bacillus are described in Molecular Biological 
Methods for Bacillus, Ed. Harwood and Cutting, John Wiley & Sons, 1990, hereby expressly 
incorporated by reference; see chapter 3 on plasmids. Suitable replicating plasmids for B. 
subtilis are listed on page 92. 

In another embodiment, nucleic acid encoding a cysteine protease(s) of the present 
invention is stably integrated into the microorganism genome. Preferred host cells are 
gram-positive host cells. Another preferred host is Bacillus. Another preferred host is 
Bacillus subtilis. Several strategies have been described in the literature for the direct 
cloning of DNA in Bacillus. Plasmid marker rescue transformation involves the uptake of a 
donor plasmid by competent cells carrying a partially homologous resident plasmid 
(Contente et aL, Plasmid 2:555*571 (1979); Haima et al., Mol. Gen. Genet. 223:185-191 
(1990); Weinrauch et al., J. Bacteriol. 154(3):1077-1087 (1983); and Weinrauch et al., J. 
Bacteriol. 169(3):1205-1211 (1987)). The incoming donor plasmid recombines with the 
homologous region of the resident "helper" plasmid in a process that mimics chromosomal 
transformation. 

Transformation by protoplast transformation is described for 8. subtilis in Chang and 
Cohen, (1979) Mol. Gen. Genet 168:111-115; for B.megaterium in Vorobjeva et al., (1980) 
FEMS Microbiol. Letters 7:261-263; for B. amyloliquefaciens in Smith et al., (1986) Appl. 
and Env. Microbiol. 51:634; for B.thuringiensis in Fisher et al., (1981) Arch. Microbiol. 
139:213-217; for B.sphaericus in McDonald (1984) J. Gen. Microbiol. 130:203; and B. larvae 
in Bakhiet et al., (1985) 49:577. Mann et al., (1986, Current Microbiol. 13:131-135) report 
on transformation of Bacillus protoplasts and Holubova, (1985) Folia Microbiol. 30:97) 
disclose methods for introducing DNA into protoplasts using DNA containing liposomes. 
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VI. Identification of Transformants ' ~ " - - ■ - ■ _ . 

Whether a host cell has been transformed with a mutated or a naturally occurring 
gene encoding a gram-positive CP1, CP2 or CP3, detection of the presence/absence of 
marker gene expression can suggests whether the gene of interest is present However, its 
expression should be confirmed. For example, if the nucleic acid encoding a cysteine 
protease is inserted within a marker gene sequence, recombinant cells containing the insert 
can be identified by the absence of marker gene function. Alternatively, a marker gene can 
be placed in tandem with nucleic acid encoding the cysteine protease under the control of a 
single promoter. Expression of the marker gene in response to induction or selection 
usually indicates expression of the cysteine protease as well. 

Alternatively, host cells which contain the coding sequence for a cysteine protease 
and express the protein may be identified by a variety of procedures known to those of skiii 
in the art. These procedures include, but are not limited to, DNA-DNA or DNA-RNA 
hybridization and protein bioassay or immunoassay techniques which include membrane- 
based, solution-based, or chip-based technologies for the detection and/or quantification of 
the nucleic acid or protein. 

The presence of the cysteine polynucleotide sequence can be detected by DNA- 
DNA or DNA-RNA hybridization or amplification using probes, portions or fragments of 
B.subtifis CP1 , CP2 or CP3. 

VII. Assay of Protease Activity 

There are various assays known to those of skill in the art for detecting and 
measuring protease activity. There are assays based upon the release of acid-soluble 
peptides from casein or hemoglobin measured as absorbance at 280 nm or colorimetrically 
using the Folin method (Bergmeyer, et al., 1984, Methods of Enzymatic Analysis vol. 5, 
Peptidases, Proteinases and their Inhibitors, Verlag Chemie, Weinheim). Other assays 
involve the solubilization of chromogenic substrates (Ward, 1983, Proteinases, in Microbial 
Enzymes and Biotechnology (W.M. Fogarty, ed.), Applied Science, London, pp. 251-317). 

VIII. Secretion of Recombinant Proteins 

Means for determining the levels of secretion of a heterologous or homologous 
protein in a gram-positive host cell and detecting secreted proteins include, using either 
polyclonal or monoclonal antibodies specific for the protein. Examples include enzyme- 
iinked immunosorbent assay (ELISA), radioimmunoassay (RIA) and fluorescent activated 
cell sorting (FACS). These and other assays are described, among other places, in 
Hampton R et al (1990, Serological Methods, a Laboratory Manual, APS Press, St Paul MN) 
and Maddox DE et al (1983, J Exp Med 158:1211). 

A wide variety of labels and conjugation techniques are known by those skilled in the 
art and can be used in various nucleic and amino acid assays. Means for producing labeled 
hybridization or PCR probes for detecting specific polynucleotide sequences include 
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oligoiabeling, nick translation, end-labeling or PCR amplification using a labeled nucleotide, _ 
Alternatively, the nucleotide sequence, or any portion of it, may be cloned into a vector for 
the production of an mRNA probe. Such vectors are known in the art, are commercially 
available, and may be used to synthesize RNA probes in vitro by addition of an appropriate 
RNA polymerase such as T7, T3 or SP6 and labeled nucleotides. 

A number of companies such as Pharmacia Biotech (Piscataway NJ), Promega 
(Madison Wl), and US Biochemical Corp (Cleveland OH) supply commercial kits and 
protocols for these procedures. Suitable reporter molecules or labels include those 
radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents as well as 
substrates, cofactors, inhibitors, magnetic particles and the like. Patents teaching the use 
of such labels include US Patents 3,817,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 
4,275,149 and 4,366,241. Also, recombinant immunoglobulins may be produced as shown 
in US Patent No, 4,816,567 and incorporated herein by reference. 

IX. Purification of Proteins 

Gram positive host cells transformed with polynucleotide sequences encoding 
heterologous or homologous protein may be cultured under conditions suitable for the 
expression and recovery of the encoded protein from cell culture. The protein produced by 
a recombinant gram-positive host cell comprising a mutation or deletion of the cysteine 
protease activity will be secreted into the culture media. Other recombinant constructions 
may join the heterologous or homologous polynucleotide sequences to nucleotide sequence 
encoding a polypeptide domain which will facilitate purification of soluble proteins (Krotl DJ 
et al (1993) DNA Cell Biol 12:441-53). 

Such purification facilitating domains include, but are not limited to, metal chelating 
peptides such as histidtne-tryptophan modules that allow purification on immobilized metals 
(Porath J (1992) Protein Expr Purif 3:263-281), protein A domains that allow purification on 
immobilized immunoglobulin, and the domain utilized in the FLAGS extension/affinity 
purification system (Immunex Corp, Seattle WA). The inclusion of a cleavable linker 
sequence such as Factor XA or enterokinase (Invitrogen, San Diego CA) between the 
purification domain and the heterologous protein can be used to facilitate purification. 

X. Uses of The Present Invention 

CP1, CP2 and CP3 and Genetically Engineered Host Cells 

The present invention provides genetically engineered host cells comprising 
preferably non-revertable mutations or deletions in the naturally occurring gene encoding 
CP1, CP2 or CP3 such that the proteolytic activity is diminished or deleted altogether. The 
host cell may contain additional protease deletions, such as deletions of the mature subtilisn 
protease and/or mature neutral protease disclosed in United States Patent No. 5,264,366. 
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In a preferred embodiment; the host-cell is further- genetically, engineered to produce 
a desired protein or polypeptide. In a preferred embodiment the host cell is a Bacillus. In 
another preferred embodiment, the host cell is a Bacillus subtilis. 

In an alternative embodiment, a host cell is genetically engineered to produce a 
gram-positive CP1, CP2 or CP3. In a preferred embodiment, the host cell is grown under 
targe scale fermentation conditions, the CP1, CP2 or CP3 is isolated and/or purified and 
used in cleaning compositions such as detergents. Detergent formulations are disclosed in 
WO 95/10615. A cysteine protease of the present invention can be useful in formulating 
various cleaning compositions. A number of known compounds are suitable surfactants 
useful in compositions comprising the cysteine protease of the invention. These include 
nonionic, anionic, cationic, anionic or zwitterionic detergents, as disclosed in US 4,404,128 
and US 4,261,868. A suitable detergent formulation is that described in Example 7 of US 
Patent 5,204,015. The art is familiar with the different formulations which can be used as 
cleaning compositions. In addition, a cysteine protease of the present invention can be 
used, for example, in bar or liquid soap applications, dishcare formulations, contact lens 
cleaning solutions or products, peptide hydrolysis, waste treatment, textile applications, as 
fusion-cleavage enzymes in protein production, etc. A cysteine protease may provide 
enhanced performance in a detergent composition (as compared to another detergent 
protease). As used herein, enhanced performance in a detergent is defined as increasing 
cleaning of certain enzyme sensitive stains such as grass or blood, as determined by usual 
evaluation after a standard wash cycle. 

A cysteine protease of the present invention can be formulated into known 
powdered and liquid detergents having pH between 6.5 and 12.0 at levels of about .01 to 
about 5% (preferably .1% to .5%) by weight. These detergent cleaning compositions can 
also include other enzymes such as known proteases, amylases, cellulases, lipases or 
endogiycosidases, as well as builders and stabilizers. 

The addition of a cysteine protease to conventional cleaning compositions does not 
create any special use limitation. In other words, any temperature and pH suitable for the 
detergent is also suitable for the present compositions as long as the pH is within the above 
range, and the temperature is below the described cysteine protease denaturing 
temperature. In addition, a cysteine protease can be used in a cleaning composition without 
detergents, again either alone or in combination with builders and stabilizers. 

One aspect of the invention is a composition for the treatment of a textile that 
includes a cysteine protease of the present invention. The composition can be used to treat 
for example silk or wool as described in publications such as RD 216,034; EP 134,267; US 
4,533,359; and EP 344,259. 
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Proteases can be included in animal feed such as.part of animal feed additives as 
described in, for example, US 5,612,055; US 5,314,692; and US 5,147,642. 

CP1. CP2 and CP3 Polynucleotides 

A B.subtlis polynucleotide, or any part thereof, provides the basis for detecting the 

presence of gram-positive microorganism polynucleotide homologs through hybridization 

techniques and PCR technology. 

Accordingly, one aspect of the present invention is to provide for nucleic acid 

hybridization and PCR probes which can be used to detect polynucleotide sequences, 

including genomic and cDNA sequences, encoding gram-positive CP1, CP2 or CP3 or 

portions thereof, 

The manner and method of carrying out the present invention may be more fully 
understood by those of skill in the art by reference to the following examples, which 
examples are not intended in any manner to limit the scope of the present invention or of 
the claims directed thereto 

Example I 
Preparation of a Genomic library 

The following example illustrates the preparation of a Bacillus genomic library. 

Genomic DNA from Bacillus cells is prepared as taught in Current Protocols In 
Molecular Biology vol. 1, edited by Ausubel et a!., John Wiley & Sons, Inc. 1987, chapter 2. 
4.1. Generally, Bacillus cells from a saturated liquid culture are lysed and the proteins 
removed by digestion with proteinase K. Cell wail debris, polysaccharides, and remaining 
proteins are removed by selective precipitation with CTAB, and high molecular weight 
genomic DNA is recovered from the resulting supernatant by isopropanol precipitation. If 
exceptionally clean genomic DNA is desired, an additional step of purifying the Bacillus 
genomic DNA on a cesium chloride gradient is added. 

After obtaining purified genomic DNA, the DNA is subjected to Sau3A digestion. 
Sau3A recognizes the 4 base pair site GATC and generates fragments compatible with 
several convenient phage lambda and cosmid vectors. The DNA is subjected to partial 
digestion to increase the chance of obtaining random fragments. 

The partially digested Bacillus genomic DNA is subjected to size fractionation on a 
1% agarose gel prior to cloning into a vector. Alternatively, size fractionation on a sucrose 
gradient can be used. The genomic DNA obtained from the size fractionation step is 
purified away from the agarose and ligated into a cloning vector appropriate for use in a 
host cell and transformed into the host cell. 
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Example il 

Detection of qram-postive microorganisms 

The following example describes the detection of gram-positive microorganism CP1. 
The same procedures can be used to detect CP2 and CP3. 

DNA derived from a gram-positive microorganism is prepared according to the 
methods disclosed in Current Protocols in Molecular Biology, Chap. 2 or 3. The nucleic acid 
is subjected to hybridization and/or PCR amplification with a probe or primer derived from 
CP1. A preferred probe comprises the nucleic acid section containing the conserved motif 
GXCWAF. 

The nucleic acid probe is labeled by combining 50 pmoi of the nucleic acid and 250 

mCi of [gamma 32 P] adenosine triphosphate (Amersham, Chicago IL) and T4 

polynucleotide kinase (DuPont NEN®, Boston MA). The labeled probe is purified with 

Sephadex G-25 super fine resin column (Pharmacia). A portion containing 10 7 counts per 

minute of each is used in a typical membrane based hybridization analysis of nucleic acid 

sample of either genomic or cDNA origin. 

The DNA sample which has been subjected to restriction endonuclease digestion is 

fractionated on a 0.7 percent agarose gel and transferred to nylon membranes (Nytran Pius, 

Schleicher & Schuel!, Durham NH). Hybridization is carried out for 16 hours at 40 degrees 

C. To remove nonspecific signals, blots are sequentially washed at room temperature 

under increasingly stringent conditions up to 0.1 x saline sodium citrate and 0.5% sodium 

dodecyi sulfate. The blots are exposed to film for several hours, the film developed and 

hybridization patterns are compared visually to detect polynucleotide homologs of B.subtilis 

CP1. The homologs are subjected to confirmatory nucleic acid sequencing. Methods for 

nucleic acid sequencing are well known in the art. Conventional enzymatic methods employ 

DNA polymerase Klenow fragment, SEQUENASE® (US Biochemical Corp, Cleveland, OH) 

or Taq polymerase to extend DNA chains from an oligonucleotide primer annealed to the 

DNA template of interest. 

Various other examples and modifications of the foregoing description and examples 

will be apparent to a person skilled in the art after reading the disclosure without departing 

from the spirit and scope of the invention, and it is intended that ail such examples or 

odifications be included within the scope of the appended claims. All publications and 

patents referenced herein are hereby incorporated by reference in their entirety. 
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CLAIMS 

1. A gram-positive microorganism having a mutation or deletion of part or all of the 
gene encoding CP1 said mutation or deletion resulting in the inactivation of the CP1 
proteolytic activity. 

2. A gram-positive microorganism having a mutation or deletion of part or all of the 
gene encoding CP2 said mutation or deletion resulting in the inactivation of the CP2 
proteolytic activity. 

3. A gram-positive microorganism having a mutation or deletion of part or all of the 
gene encoding CP3 said mutation or deletion resulting in the inactivation of the CP3 
proteolytic activity, 

4. The gram-positive microorganism according to Claims 1, 2 or 3 that is a member 
of the family Bacillus. 

5. The microorganism according to Claim 4 wherein the member is selected from 
the group consisting of B, subtilis, B. licheniformis, B. lentus, B. brevis, B. 
stearothermophilus, B. alkalophilus, B. amyfoliquefaciens, B. coagulans, B. circulans, B. 
lautus and Bacillus thuringiensis. 

6. The microorganism of Claim 1, 2 or 3 wherein said microorganism is capable of 
expressing a heterologous protein. 

7. The microorganism of Claim 6 wherein said heterologous protein is selected from 
the group consisting of hormone, enzyme, growth factor and cytokine. 

8. The microorganism of Claim 7 wherein said heterologous protein is an enzyme. 

9. The microorganism of Claim 8 wherein said enzyme is selected from the group 
consisting of a proteases, carbohydrases, and lipases; isomerases such as racemases, 
epimerases, tautomerases, or mutases; transferases, kinases and phophatases. 

10. A cleaning composition comprising at least one cysteine protease selected from 
the group consisting of CP1 r CP2 and CP3. 




i 



f 



* 



WO 99/04016 PCT/US98/14529 

18 

11. An expression vector comprising nucleic acid encoding a cysteine protease 
selected from the group consisting of CP1, CP2 and CP3. 



12. A host celt comprising an expression vector according to Claim 1 1 . 

13. A method for the production of a heterologous protein in a Bacillus host cell 
comprising the steps of 

(a) obtaining a Bacillus host cell comprising nucleic acid encoding said 
heterologous protein wherein said host cell contains a mutation or deletion in at least 
one of the genes encoding cysteine protease 1, cysteine protease 2 and cysteine 
protease 3; and 

(b) growing said Bacillus host cell under conditions suitable for the expression 
of said heterologous protein. 

14. The method of Claim 13 wherein said Bacillus cell is selected from the group 
consisting of Bacillus subtilis, B. licheniformis, B. lentus, B. brevis, B. stearothermophilus, B. 
alkalophilus f B. amyloliquefaciens, B. coagufans, B. circulans t B. lautus and Bacillus 
thuringiensis. 

15. The method of Claim 13 wherein said Bacillus host cell further comprises a 
mutation or deletion in at least one of the genes encoding apr, npr, epr, wpr and mrp. 

16. A gram-positive microorganism having at mutation or deletion in at least one of 
the genes encoding a cysteine protease selected from the group consisting of CP1, CP2 
and CP3. 



17. The microorganism of Claim 16 further comprising a mutation or deletion in at 
least one of the genes encoding apr, npr, epr, wpr and mrp. 
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10 30 
atgacgactgaaccgttatttttcaagcctgttttcaaagaaagaatt 
MTTEPLFFKPVFKERI 

50 70 90 

tggggcgggaccgctttagctgattttggctataccattccgtcacaa 

WGGTALADFGYTIPSQ 

110 130 
cgaacaggggagtgctgggcttttgccgcgcatcaaaatggtcaaagc 
RTGECWAFAAHQNGQS 

150 170 190 

gttgttcaaaacggaatgtataaggggttcacgctcagcgaattatgg 
VVQNGMYKGFTLSELW 

210 230 
gaacatcacagacatttattcggacagcttgaaggggaccgtttccct 
EHHRHLFGQLEGDRFP 

250 270 2 

ctgcttacaaaaatattagatgctgaccaggacttatctgttcaggtg 
LLTKI LDADQDLSVQV 

90 310 330 

catccgaatgatgaatatgccaacatacatgaaaacggtgagcttgga 

HPNDEYANIHENGELG 

350 370 
aaaacagaatgctggtacattattgattgccaaaaagatgccgagatt 
KTECWYI IDCQKDAEI 

390 410 430 

atttatggccacaatgcaacaacaaaggaagaactaactaccatgata 
IYGHNATTKEELTTMI 

450 470 
gagcgtggagaatgggatgagctcttgcgccgtgtaaaggtaaagccg 
ERGEWDELLRRVKVKP 

490 510 5 

ggggattttttctatgtgccaagcggtactgttcatgcgattggaaaa 
GDFFYVPSGTVHAIGK 

30 550 570 

ggaattcttgctttggagacgcagcagaactcagacacaacctacaga 

GILALETQQNSDTTYR 
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10 30 
atgacgcatccattatttttagagcctgtctttaaagaaagactatgg 
MTH PLFLEPVFKERLW 

50 70 90 

ggagggacgaagcttcgtgacgcttttggctacgcaataccctcacaa 

GGTKLRDAFGYAIPSQ 

110 130 
aaaacaggtgagtgctgggccgtttctgcacatgcccatggctcgtcg 
KTG ECWAVSAHAHG S S 

150 170 190 

tctgtaaaaaatggcccgctggcaggaaagacacttgatcaagtatgg 
SVKNGPLAGKTLDQVW 

210 230 
aaagatcatccagagatattcgggtttccggatggtaaggtgtttccg 
KDH PEIFGFPDGKVFP 

250 270 2 

ctgctggtaaagctgctggacgccaatatggatctctccgtgcaagtc 
LLVKLLDANMDLSVQV 

90 310 330 

catcctgatgatgattatgcaaaactgcacgaaaatggcgaccttggt 

HPDDDYAKLHENGDLG 

350 370 
aaaacggagtgctggtatatcattgattgcaaagatgacgccgaacta 
KTE CWYI IDCKDDAEL 

390 410 430 

attttgggacatcatgcaagcacaaaggaagagttcaaacaacgaata 
ILGHHASTKEEFKQRI 

450 470 
gaaagcggtgattggaacgggctgctgaggcgaatcaaaatcaagcca 
ESGDWNGLLRRIKIKP 

490 510 5 

ggagatttcttttatgtgccaagcggtacactccatgctttatgtaag 
GDF FYVP SGTLHALCK 

30 550 570 

ggaacccttgtccttgaaatccagcaaaactctgatacaacatatcgc 

GTLVLEIQQNSDTTYR 
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61° 

gtatacgattatgaccgctgtaatgaccaggg Q R R T L 

670 

630 ^!macoataccgcatatcgataaa 

catatagaaaaagccatggaagtcataacga p h I D K 

hI EKAMEV-l 

710 

690 ^^^acactgagatcattgtt 
aaaagaagttggtaacgccg y 



gtgcatacaccggaagtaaaay * Q N A E I 



750 



730 . ^ttctcagtgtacaaatggaagattagcggc 

tatgtgcaatcagattatttctcagtg^ R w K x s G 

y v Q s d y 

810 

70 , .tttccttcatltcaaacctatttgctggggagtgttctg 
cgagctgcttttcctcca tylLGSV^ 
rAAFPSYU 

* 910 

890 

870 t ..^tacctgcgcattttggagaatttacaaea 

g ca g gct=acac«tatt=tgc=tgog H p q j; F T I 

h G S H r 

ga a g9 aaca tgt| aa"tc- grts ct=atcc t 
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10 30 
atgacgcaatcaccgatttttctaacgcctgtgtttaaagaaaaaatc 
MTQSPIFLTPVFKEKI 

50 70 90 

tggggcggaaccgctttacgagatagatttggatacagtattccttca 

WGGTALRDRFGYSIPS 

110 130 
gaatcaacgggggaatgctgggccatttccgctcatccaaaaggaccg 
ESTGECWAISAHPKGP 

150 170 190 

agcactgttgcaaatggcccgtataaaggaaagacattgatcgagctt 
S TVANGFYKGKTLIEL 

210 230 
tgggaagagcaccgtgaagtattcggcggcgtagagggggatcggttt 
WEEHREVFGGVE GDRF 

250 270 2 

ccgcttctgacaaagctgctggatgtgaaggaagatacgtcaattaaa 
PLLTKLLDVKEDTSIK 

90 310 330 

gttcaccctgatgattactatgccggagaaaacgaagagggagaactc 

VHPDDYYAGENEEGEL 

350 370 
ggcaagacggaatgctggtacattatcgactgtaaggaaaacgcagaa 
GKTECWYIIDCKENAE 

390 410 430 

atcatttacgggcatacggcccgctcaaaaaccgaacttgtcacaatg 
I I YGHTARSKTELVTM 

450 470 
atcaacagcggtgactgggagggcctgctgcgaagaatcaaaattaaa 
INSGDWEGLLRRIKIK 

490 510 5 

ccgggtgatttctattatgtgccgagcggaacgctgcacgcattgtgc 
PGDFYYVPSGTLHALC 

30 550 570 

aagggggcccttgttttagagactcagcaaaattcagatgccacatac 

KGALVLETQQNSDATY 
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590 610 
cgggtgtacgattatgaccgtcttgatagcaacggaagtccgagagag 
RVYDYDRLDSNGS PRE 

630 650 670 

cttcattttgccaaagcggtcaatgccgccacggttccccatgtggac 
LHFAKAVNAATVPHVD 

690 710 
gggtatatagatgaatcgacagaatcaagaaaaggaataaccattaaa 
GYIDESTESRKGITIK 

730 750 7 

acatttgtccaaggggaatatttttcggtttataaatgggacatcaat 
TFVQGEYFSVYKWDIN 

70 790 810 

ggcgaagctgaaatggctcaggatgaatcctttctgatttgcagcgtg 

GEAEMAQDESFLICSV 

830 850 
atagaaggaagcggtttgctcaagtatgaggacaaaacatgtccgctc 
IEGSGLLKYEDKTCPL 

870 890 910 

aaaaaaggtgatcactttattttgccggctcaaatgcccgattttacg 
KKGDHFILPAQMPDFT 

930 

ataaaaggaacttgtacccttatcgtgtctcatatt 
IKGTCTLIVSHI 
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